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SUMMARY 

NANOG is a divergent homeobox protein and a core component of the transcriptional circuitry that sustains pluripotency and 
self-renev\^aL Although NANOG has been extensively studied on the transcriptional level, little is knov\^n regarding its posttranslational 
regulation, likely due to its low abundance and challenging physical properties. Here, v^^e identify eleven phosphorylation sites on 
endogenous human NANOG, nine of v\^hich mapped to single amino acids. To screen for the signaling molecules that impart these 
modifications, we developed the multiplexed assay for kinase specificity (MAKS). MAKS simultaneously tests activity for up to ten kinases 
while directly identifying the substrate and exact site of phosphorylation. Using MAKS, we discovered site-specific phosphorylation 
by ERK2 and CDKl/CyclinA2, providing a putative link between key signaling pathways and NANOG. 



INTRODUCTION 

Phosphorylation is a pervasive form of cell signaling that 
orchestrates numerous processes, including metabolism, 
cell mobility, cell cycle, and differentiation (Brumbaugh 
et al, 2011; Van Hoof et al, 2009; Xu and Fisher, 2012). 
Mass spectrometry has revealed the complexity of the phos- 
phoproteome in pluripotent cells with great detail (Muhoz 
and Heck, 2011; Phanstiel et al., 2011; Rigbolt et al., 2011; 
Swaney et al, 2009; Van Hoof et al, 2009); however, deter- 
mining the biological relevance of such data remains a 
major challenge. Mapping the kinases responsible for a 
specific phosphorylation event is instructive because it 
places that information in the context of signaling mole- 
cules that direct biological function. Traditionally, kinase 
assays are performed by detecting the transfer of a radioac- 
tive phosphoryl group to a given substrate following an 
in vitro reaction. This method provides a direct measure 
of phosphorylation but necessitates the use of hazardous 
materials, cannot directly localize phosphorylation to a sin- 
gle amino acid when more than one potential site is present, 
and cannot multiplex kinases and substrates. Recently, a 
mass-spectrometric method was introduced to profile phos- 
phorylation on synthetic peptides treated with cell lysates 
(Yu et al., 2009). This method is ideal for profiling cell- 
type-specific phosphorylation, but does not directly deter- 
mine the kinase responsible for phosphorylation. Another 
method assesses kinase activity but relies upon heavy 



isotope-labeled amino acids and is limited to testing one 
or two kinases at a time (Singh et al., 2012b). Several other 
methods have been developed to identify kinase consensus 
motifs or test a single kinase, but are not capable of multi- 
plexed analysis (Hennrich et al., 2013; Kettenbach et al., 
2012; Songyang et al, 1994; Xue et al, 2012). Thus, there 
remains a pressing need for a high-throughput method to 
screen for kinase(s) that phosphorylate a protein of interest. 

In pluripotent cells, phosphorylation has a central role 
in directing cell identity by relaying growth-factor sig- 
naling through key pathways (i.e., fibroblast growth factor 
[FGF] and transforming growth factor p [TGF-3]) (Chen 
et al., 2011; Singh et al., 2012a; Vallier et al, 2005; Yu 
et al., 2011). The ultimate targets of these phosphorylation 
cascades are largely unknown, although recent works have 
provided some direction by mapping phosphorylation on 
the pluripotency factors OCT4 and SOX2 (Brumbaugh 
et al, 2012; Jeong et al, 2010; Phanstiel et al, 2011). 
Conspicuously absent in these studies is NANOG, a diver- 
gent homeobox transcription factor that promotes plurip- 
otency by binding to DNA and regulating the expression 
of genes related to cell fate (Boyer et al., 2005; Chambers 
et al, 2003; Mitsui et al, 2003; Pan and Thomson, 2007). 
In mice, overexpression of NANOG permits extended 
culture of undifferentiated embryonic stem cells (ESCs) 
in the absence of otherwise obligatory extrinsic signaling 
factors such as LIF and BMP4 (Chambers et al., 2003; 
Pan and Thomson, 2007). Correspondingly, NANOG 



1 8 Stem Cell Reports \ Vol. 2 | 1 8-25 | January 1 4, 20 1 4 | ©20 1 4 The Authors 



CrossMark 



Stem Cell Reports 

Phosphorylation of Human NANOG 



Phosphorylation site 


Localized 


Putative 
ERKsite? 


Enzyme 


Ser22 or Ser23 


No 


Yes 


GluC 


ThrSO 


Yes 


No 


GluC 


Ser52 


Yes 


Yes 


GluC 


Ser56 


Yes 


No 


GluC 


Ser57 


Yes 


No 


GluC 


Ser65 


Yes 


Yes 


GluC 


Ser68 or Ser69 


No 


No 


GluC 


Thr70 


Yes 


No 


GluC 


Ser71 


Yes 


Yes 


GluC 


Ser79 


Yes 


No 


Trypsin 


Ser258 


Yes 


Yes 


GluC 



|Trypsin QgIuC |~|Both 



Contribution to sequence 

coverage Phosphorylation IDs 




TVf|p|L|PSf(Vlp[L[L|Qpf[Pp^pT^[PKp^^ 



Vbo yn Vis y^ y^ y^ y2oyi9 yis yu y^e y,5 y,4 y,3 yn yio ys y? ys 

-x3 1 I x3- 

Vn'^ SMDLLIQDSP 
PSSMDLLIQD 

yjJ [y,,-H3P0/+ 



NL: 2.72E5 




[y^g-H3P0J+ 

Vie 



[y^g-HjPO/ 

[y.3-H3P0//'/y--^^°]^ 



m/z 400 600 800 1000 1200 1400 1600 1800 2000 2200 2400 



|Sequence coverage (^PO^site Putative ERK PP. site 



W-repeat— 



Figure 1. Mass Spectrometry Identifies Phosphorylation on Human NANOG 

(A) NANOG phosphorylation sites. 

(B) The contribution to either sequence coverage or phosphorylation site identification for sampLes digested with trypsin or GLuC. 

(C) Representative spectrum showing unambiguous Localization of phosphorylation on Ser71. Neutral Loss (asterisk) and internal frag- 
ments are annotated. 

(D) Schematic of NANOG with functional domains. Question marks denote phosphorylation sites that were indistinguishable between 
adjacent residues. 

See also Figures SI and S2, and Table SI. 



overexpression in human ESCs obviates the requirement 
for exogenous signaling through feeder cells in basal media 
or FGF in defined culture systems (Darr et al, 2006; Xu 
et al., 2008). Hence, NANOG has a conserved role in 
mediating growth-factor signals that are critical for plurip- 
otency, and, intriguingly, its overexpression is sufficient 
to bypass these signaling pathways to maintain the ESC 
state. Still, a direct link between NANOG and the signaling 
molecules that determine cell state remains elusive. 

In mouse, NANOG protein levels are dynamic (Cham- 
bers et al., 2007), and it was recently proposed that NANOG 
stability is tied to phosphorylation (Moretto-Zita et al, 
2010). Several studies suggested that mouse NANOG exists 
as a phosphoprotein (Li et al, 2011; Moretto-Zita et al, 
2010; Yates and Chambers, 2005); however, its unique 
primary sequence and relatively low abundance make it 
difficult to purify and detect in a physiologically relevant 
context (i.e., without overexpression and in pluripotent 
cell types). As a result, there are currently no known phos- 
phorylation sites for NANOG from human pluripotent 
stem cells and only a single site for endogenous mouse 



NANOG (Li et al, 2011). To address this gap, we applied 
high-resolution mass spectrometry to show that NANOG 
is heavily phosphorylated at proline-directed sites on its 
N terminus. To place these modifications in the context 
of signaling molecules, we developed the multiplexed assay 
for kinase specificity (MAKS), and found that ERK2 and 
CDKl differentially phosphorylate NANOG in vitro. 

RESULTS 

Identification of NANOG Phosphorylation Sites 

Due to the underrepresentation of NANOG in proteomic 
data sets, we adapted a highly efficient purification strategy 
that combines high-detergent lysis with multiple forms of 
affinity chromatography (see Supplemental Experimental 
Procedures available online; Brumbaugh et al., 2012). 
Consistent with previous results, traditional digestion 
methods that utilize trypsin in preparation for mass spec- 
trometry yielded only one phosphopeptide identification 
(Figures lA and IB). Inspection of the primary sequence 
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for NANOG revealed that tryptic cleavage sites were tightly 
clustered at the center of the protein, with few sites near 
either termini, resulting in a limited number of peptides 
that were amenable to mass spectrometry (Figure SI). We 
therefore tested several alternate proteases and found that 
GluC provided superior sequence coverage (Figures IC 
and ID; Swaney et al., 2010). In total, we sequenced over 
40% of the endogenous NANOG protein and identified 
11 phosphorylation sites (<1% false discovery rate), nine 
of which localized to a single residue (Figure ID; Table 
SI). Note that the exact site of phosphorylation could not 
be distinguished between Ser22/23 and Ser68/69 by our 
stringent criteria, but were localized to adjacent residues 
in each case. Figure IC shows a representative spectrum 
with a fully annotated, localized modification; all spectra 
were manually validated to confirm each phosphorylation 
site (Figure S2). Many of the peptides identified from 
this region were multiply phosphorylated, indicating 
that endogenous NANOG is hyperphosphorylated in the 
pluripotent state. 

To place NANOG phosphorylation sites in the context of 
cell signaling, we used the Motif-X program to search for 
kinase motifs in our data set (Schwartz and Gygi, 2005). 
Intriguingly, a preponderance of NANOG's phosphory- 
lation sites were proline directed (xP) and matched the 
minimal ERK motif (5/11 sites; Figure IB) (Songyang 
et al, 1996). This finding is compelling from a biological 
standpoint because NANOG overexpression sustains plu- 
ripotency upon ERK inhibition (Xu et al., 2008) and activa- 
tion of ERK2 maintains NANOG, even in differentiating 
cells (Yu et al., 2011). Further computational analysis using 
the NetPhorest platform substantiated the hypothesis 
that ERK2 phosphorylates these sites, but also pointed to 
CDKl as a possible modifying enzyme (Miller et al., 
2008). Note that these kinases share similar motifs, and 
the importance of applying both experimental and com- 
putational methods to determine kinase-substrate rela- 
tionships has been documented (Linding et al., 2007). 
Thus, it is important to directly screen for the kinases 
that modify NANOG. 

MAKS 

To directly identify kinases that target specific residues on 
NANOG, we developed a mass-spectrometric approach 
called MAKS. This in vitro assay leverages isobaric tagging 
technology to monitor the activity of up to ten kinases in 
a single experiment. In the first step, candidate kinases, 
ATP, and reaction buffer are added to purified protein (s) 
in separate but otherwise identical reactions (Figure 2A). 
Following incubation, each sample is digested in prepara- 
tion for mass spectrometry and labeled with a different 
isobaric tag. This tag imparts a unique chemical signature 
to every peptide, linking it to the kinase-substrate reaction 



from which it originated. Samples are combined and sub- 
jected to quantitative tandem mass spectrometry to 
directly identify peptide sequences and sites of phosphory- 
lation. During peptide fragmentation, the isobaric tags are 
cleaved, generating reporter ions that are detected in the 
low-mass region of tandem mass spectra (Figure 2B). The 
intensity of these reporter ions functions as a readout 
for the abundance of phosphorylated peptides from the 
corresponding kinase reaction. 

MAKS Reveals Site-Specific Phosphorylation by ERK2 
and CDKl on NANOG 

We selected AKTl, AKT2, CDKl/CyclinA2, CDK5/p35, 
ERK2, and PKA as candidate kinases for NANOG modifica- 
tion. We chose these kinases based primarily on motif anal- 
ysis for phosphorylation sites on NANOG (ERK2 and 
CDKl/5), but also considered kinases involved in pluripo- 
tency and differentiation (AKTl/2 and PKA). This selection 
of kinases also covered candidates from both the AGC 
and CMCG kinase families. All kinases were active and 
capable of phosphorylating numerous control substrates 
(Table S2). We therefore performed our in vitro assay by 
incubating each kinase with full-length, recombinant 
NANOG. Following mass spectrometry, reporter ion inten- 
sities were mean normalized and adjusted using nonphos- 
phorylated peptides to correct for pipetting error and 
sample loss (see Experimental Procedures; Figure S3 A). In 
total, we quantified five phosphorylation sites, all of which 
corresponded to modifications on the endogenous protein 
(Figures 3 and S3B). As shown in Figure 3, ERK2 demon- 
strated a clear preference for Ser52, a putative ERK site 
that was also identified on the endogenous protein (Fig- 
ure 3B). The abundance of phosphorylation for Ser52 was 
5 -fold higher for the ERK2 channel than for the next- 
closest channel. Interestingly, our data also showed that 
Ser71, another putative ERK site identified on endogenous 
NANOG, was instead preferentially phosphorylated by 
CDKl/CyclinA2 (Figure 3C), while Ser65 was modified 
by both ERK2 and CDKl/CyclinA2 (Figure S3B). This 
demonstrates the remarkable specificity achieved through 
this simple assay and also illustrates the importance of 
determining phosphorylation for individual sites. Phos- 
phorylation at Ser23 and Ser258 was similar across all 
kinases (Figures 3D and S3B), suggesting that (1) a different 
kinase is responsible for phosphorylation at these sites, (2) 
cof actors not included in this assay are required for 
specificity, or (3) these sites can be phosphorylated by 
a variety of kinases. Importantly, this shows that the 
specificity observed at Ser52 and SerZl is not the result of 
higher activity for the respective kinases. Phosphorylation 
sites that were identified on the endogenous protein 
but not quantified in the MAKS assay may be subject to 
regulation by kinases that were not included in our panel. 
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Figure 2. A Simplified Schematic for MAKS 

(A) Identical protein samples are treated separately with kinases. Labeled, and mixed prior to analysis. A six-plex experiment is shown, but 
ten samples can be simultaneously assayed. 

(B) Peptide identity and phosphorylation site localization are determined using tandem mass spectrometry. Reporter tags are cleaved 
during tandem mass spectrometry and appear in the low-mass range (liftout). The relative intensity for each reporter tag is directly 
proportional to the amount of phosphorylation contributed by the corresponding kinase. 



or may require additional cofactors and/or subcellular 
localization. 



DISCUSSION 

Together with OCT4 and SOX2, NANOG functions as a key 
transcriptional regulator in pluripotency. To date, almost 
no information is available regarding posttranslational 
regulation of NANOG. Our data show that NANOG is 
multiply phosphorylated in human ESCs. Nine of 11 sites 
were observed in a serine/threonine-rich patch from resi- 
dues 50-79, suggesting that this region may serve as a 
regulatory cluster. In support of this notion, recent compu- 
tational work identified a putative PEST sequence (residues 



47-72) on NANOG that overlaps greatly with the hyper- 
phosphorylated region (Figure ID; Ramakrishna et al., 
201 1). Deletion of the PEST sequence altered NANOG ubiq- 
uitination and degradation (Ramakrishna et al., 2011). 
Further, mutational analysis in mouse for sites homologous 
to Ser52, Ser65, and Ser71 suggested that phosphorylation 
of these residues stabilizes NANOG (Moretto-Zita et al., 
2010). All of these results point to a role for this region in 
regulating NANOG protein levels. Our data provide direct 
evidence that these regulatory sites are phosphorylated 
on human NANOG and in a physiological context. The 
function of phosphorylation at Ser22/23 and Ser258 is 
currently unclear; however, Ser258 is located in the CD2 
activation domain and may modulate NANOG's ability to 
activate transcription (Do et al., 2009). Further work will 
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Figure 3. ERK2 and CDKl Specifically Phosphorylate Different Sites on NANOG 

(A) Mass spectrum from a typical kinase assay with Ser52 phosphorylation. Neutral Loss (asterisk) is annotated. The Liftout shows a zoomed 
view of the reporter ion region. 

(B) ERK2 preferentially phosphoryLates Ser52. 

(C) CDKl specifically phosphorylates Ser71. 

(D) No kinase tested shows a preference for Ser258. Bar graphs represent the normalized abundance of at least three PSMs. Error bars 
represent the SD calculated from at least three PSMs across three independent experiments. The dashed line indicates the level that would 
be expected if the contribution for all channels were equal. 

See also Figure S3 and Table S2. 



be needed to clarify the exact role of each modification 
identified here. 

For years, FGF has been a requisite component in human 
pluripotent stem cell media (Chen et al, 2011; Ludwig 
et al; 2006), and yet the mechanism by which FGF 
supports pluripotency was unknown. Our data show that 
ERK2, a downstream effector in the FGF pathway, directly 



phosphorylates NANOG at Ser52. A recent report demon- 
strated that phosphorylation of a homologous site on 
mouse NANOG facilitated interaction with Pinl, a prolyl 
isomerase that inhibits ubiquitination and subsequent 
proteosomal degradation by inducing conformational 
changes (Moretto-Zita et al., 2010). Together, these data 
suggest that ERK2 phosphorylation positively regulates 
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NANOG by increasing its stability. The interplay between 
ERK and CDK signaling is also interesting in the context 
of pluripotency Our data show that these kinases phos- 
phorylate similar sequences on NANOG that are separated 
by only 19 amino acids. This result raises the possibility 
that CDKl functions analogously to ERK, perhaps to sus- 
tain NANOG during S/M phase; however, this is only 
speculative at this point. 

MAKS provides a multiplexed strategy for simulta- 
neously screening multiple kinases and protein substrates. 
A major advantage of this assay is its capacity to directly 
assess kinase activity for individual phosphorylation sites. 
In the case of NANOG, this feature enabled us to dissect 
the specificity of ERK2 and CDKl/CyclinA2, even on 
nearby sites. This specificity may result from three-dimen- 
sional protein structure or cooperation with CyclinA2, 
although it is also possible that the primary sequence 
alone is sufficient to direct kinase activity. In this regard, 
a further benefit of MAKS is that it allows one to carry 
out kinase assays using whole proteins as a substrate. 
This preserves a potential phosphorylation site in the 
context of adjacent residues or protein structure. Other 
cofactors, subcellular localization, or additional modi- 
fications may be necessary to confer specificity. MAKS 
provides a means to test the role of many of these addi- 
tional components, which can be added or removed 
from the kinase reaction to determine their effect. The 
multiplexing capacity is only limited by the number of 
isobaric tags available from the manufacturer (currently 
ten tags are commercially available), and because sub- 
strates are directly identified via mass spectrometry, 
multiple proteins of interest can be analyzed in a single 
reaction. 

We note that the MAKS assay applies relative quantita- 
tion and therefore requires careful selection of candidate 
kinases. In some instances, for example, a physiological 
kinase for a given substrate may be absent from the panel 
tested. In such cases, MAKS will identify the kinase 
that exhibits the highest activity for a given substrate, if 
indeed there is a preference among the kinases tested. 
Computational methods will be critical for selecting the 
most likely candidate kinases, and testing a wide variety 
of kinases should help in identifying the most relevant 
kinases. To this end, we are currently working to couple 
kinase panels to automated mass spectrometry to increase 
the throughput of the MAKS assay and expand the 
number of kinases that can be tested to hundreds of 
kinases. The MAKS approach should also be applicable 
to any protein-modifying enzyme (i.e., acetylase, methyl- 
transf erase, etc.). Together, the simplicity and multiplex- 
ing capabilities of this assay make MAKS widely applicable 
in screening for enzymes that modify a substrate of 
interest. 



EXPERIMENTAL PROCEDURES 

Kinase Assays 

Kinase assays were carried out in 25 mM MOPS, pH 7.2, 12.5 mM 
beta-glycerol-phosphate, 25 mM MgCh, 5 mM EGTA, 2 mM EDTA, 
1 mM dithiothreitol (DTT), and 0.1 mM ATP. Approximately 1 jig 
of recombinant NANOG and 0.25 of AKTl, AKT2, CDKl, 
CDK5, or ERK2 (all from Promega), or 2.5 U of PKA (Sigma) were 
added for 30 min at 23° C. For control reactions, 0.5 [ig of 
maltose-binding protein (MBP) and histone HI (SignalChem) 
were added in place of NANOG. The reactions were then treated 
with 5 mM DTT at 37°C for 30 min, 10 mM iodoacetamide in 
the dark at 23°C for 30 min, and 10 mM DTT at 23°C for 30 min. 
Proteins were digested with GluC or chymotrypsin (both 
from Roche Diagnostics) at an enzyme-to-protein ratio of 1:200, 
purified by solid-phase extraction, and labeled with TMT 
isobaric tags (Thermo-Fisher) according to the manufacturer's 
recommendations. 

Data analysis 

Phosphosite localization was evaluated using Phosphinator soft- 
ware (Swaney et al., 2009). This algorithm processes the theoret- 
ical fragment ion m/z ratios for all possible permutations of 
phosphopeptide isoforms based on sequence and the number 
of phosphorylation sites. It then compares each experimental 
spectrum to the theoretical product ions using a product mass 
tolerance of + 0.02 Th. Phosphinator determines the significance 
of observed product ions that designate phosphorylation at a 
given amino acid. A localized site must have a p value of <0.05 
for consideration. Further details about this algorithm were previ- 
ously described (Swaney et al., 2009). Each localized phosphosite 
reported by Phosphinator was manually validated. Other high- 
scoring spectra with unlocalized phosphosites (as reported by 
Phosphinator) were manually interrogated to determine whether 
localization could be achieved. Theoretical product ions from 
other candidate phospho-isoforms were generated using the MS 
Product program in Protein Prospector (http://prospector.ucsf. 
edu/prospector/cgi-bin/msform.cgi?form=msproduct), and locali- 
zation and spectral chimerism were evaluated (Clauser et al., 
1999). In all cases, product ions were required to be within 15 
ppm of the theoretical m/z with a signal-to-noise ratio > 3 to 
be used for sequence assignment. Both neutral loss product 
ions and internal fragments were considered during manual 
validation in addition to standard b- and y-type ions (HCD) 
and c- and z-type product ions (ETD). 

TMT isotope purity corrections were applied using the TagQuant 
program in COMPASS (Wenger et al., 2011). Purity-corrected tag 
intensities were mean normalized for all peptide spectral matches 
(PSMs). We then took advantage of £. coli proteins that copurified 
with recombinant NANOG to further correct for pipetting error 
and sample loss. Briefly, E. coli PSMs were filtered to eliminate 
any peptide hit that contained a potential phosphorylation site 
(STY) or exhibited greater than 25% interference. The values 
for each channel were then averaged to establish the mean bias 
for that channel. Dividing one by each mean bias generated 
correction factors, which were used to normalize the quantitative 
values for each channel. 
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SUPPLEMENTAL INFORMATION 

Supplemental Information includes Supplemental Experimental 
Procedures, three figures, and two tables and can be found 
with this article online at http://dx.doi.0rg/lO.lOl6/j.stemcr. 
2013.12.005. 
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